Modeling and Optimization of Straggling Mappers

نویسندگان

  • F. Farhat
  • D. Z. Tootaghaj
  • A. Sivasubramaniam
  • M. Kandemir
  • C. R. Das
چکیده

MapReduce framework is widely used to parallelize batch jobs since it exploits a high degree of multi-tasking to process them. However, it has been observed that when the number of mappers increases, the map phase can take much longer than expected. This paper analytically shows that stochastic behavior of mapper nodes has a negative effect on the completion time of a MapReduce job, and continuously increasing the number of mappers without accurate scheduling can degrade the overall performance. We analytically capture the effects of stragglers (delayed mappers) on the performance. Based on a response time distribution of mappers, we then model the map phase by means of hardware, system, and application parameters. Mean sojourn time (MST), the time needed to sync the completed map tasks at one reducer, is mathematically formulated. Following that, we optimize MST by finding the task inter-arrival time to each mapper node. The optimal mapping problem leads to an equilibrium property investigated for different types of inter-arrival and service time distributions in a heterogeneous datacenter (i.e., a datacenter with different types of nodes). Our experimental results show the performance and important parameters of the different types of schedulers targeting MapReduce applications. We also show that, in the case of mixed deterministic and stochastic schedulers, there is an optimal scheduler that can always achieve the lowest MST.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Impact of range straggling and multiple scattering on proton therapy of brain, using a slab head phantom

Background: The advantages of proton beam in radiation therapy- like small lateral scattering as well as absence of exit dose tail in the organs which are after the tumor- make it capable of delivering more treatment doses to the target and much lesser to the critical tissues near it. Materials and Methods: In this study, the Monte Carlo MCNPX code has been used to simulate a slab head phantom ...

متن کامل

Performance comparison of land change modeling techniques for land use projection of arid watersheds

The change of land use/land cover has been known as an imperative force in environmental alteration, especially in arid and semi-arid areas. This research was mainly aimed to assess the validity of two major types of land change modeling techniques via a three dimensional approach in Birjand urban watershed located in an arid climatic region of Iran. Thus, a Markovian approach based on two suit...

متن کامل

Defining Semantic Variations of Diagrammatic Languages Using Behavioral Programming and Queries

We present a methodology for describing executable semantics of diagrammatic modeling languages, and an execution engine based on such definition. Under proposed methodology, languages are defined using a set of pairs, composed of a query and a group of mappers. The queries, defined over the language’s diagrammatic syntax, return language constructs. These constructs are mapped by the mappers t...

متن کامل

Host sympatry and body size influence parasite straggling rate in a highly connected multihost, multiparasite system

Parasite lineages commonly diverge when host lineages diverge. However, when large clades of hosts and parasites are analyzed, some cases suggest host switching as another major diversification mechanism. The first step in host switching is the appearance of a parasite on an atypical host, or "straggling." We analyze the conditions associated with straggling events. We use five species of colon...

متن کامل

Modeling and Optimization of Industrial Multi-Stage Compressed Air System Using Actual Variable Effectiveness in Hot Regions

In this article, modeling and optimization of power consumption of two–stage compressed air system has been investigated. To do so, the two – stage compressed air cycle with intercooler of FAJR Petroleum Company was considered. This cycle includes two centrifugal compressors, a shell, and a tube intercooler. For modeling of power consumption, actual compressors isentropic efficiencies and inter...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015